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Abstract 

Given a large edge-weighted network G with k terminal vertices, we wish to compress it 
and store, using little memory, the value of the minimum cut (or equivalently, maximum flow) 
between every bipartition of terminals. One appealing methodology to implement a compression 
of G is to construct a mimicking network: a small network G' with the same k terminals, in 
which the minimum cut value between every bipartition of terminals is the same as in G. This 
notion was introduced by Hagerup, Katajaincn, Nishimura, and Ragde [JCSS '98], who proved 
that such G' of size at most 2 2 always exists. Obviously, by having access to the smaller 
network G", certain computations involving cuts can be carried out much more efficiently. 

We provide several new bounds, which together narrow the previously known gap from 
doubly-exponential to only singly-exponential, both for planar and for general graphs. Our 
first and main result is that every fc-terminal planar network admits a mimicking network G' 
of size 0(k 2 2 2k ), which is moreover a minor of G. On the other hand, some planar networks 
G require \E(G')\ > £l(k 2 ). For general networks, we show that certain bipartite graphs only 
admit mimicking networks of size |V^(G")| > 2 n ^ k \ and moreover, every data structure that 
stores the minimum cut value between all bipartitions of the terminals must use 2 n ( fc ) machine 
words. 



"This work was supported in part by The Israel Science Foundation (grant #452/08), by a US-Israel BSF grant 
#2010418, and by the Citi Foundation. 



1 Introduction 



These days, more than ever, we deal with huge graphs such as social networks, communication 
networks, roadmaps and so forth. But even when our main interest is only in a small portion 
of the input graph G, we still need to process all or most of it in order to answer our query, 
since the runtime and memory requirements of many common graph algorithms depend on the 
input (graph) size. Therefore, a natural question is whether we can find a smaller graph G' that 
exactly (or approximately) preserves some property of the original graph such as distances, cuts 
and connectivity. This basic concept is known as a graph compression and was first introduced 
by Feder and Motwani |FM95], although their definition was slightly different technically. They 
require that the compressed graph has fewer edges than the original graph, and that each graph 
can be quickly computed from the other one. They have demonstrated how this paradigm leads to 
significantly improved running time by implementing it for several graph problems. 

Yet another significant advantage of the compressed graph G' is that it requires far less memory 
then storing the original graph G, which could be critical for machines with limited resources such as 
smartphones, assuming that the preprocessing can be executed in advance on much more powerful 
machines. This paradigm becomes indispensable when computations on the compressed graph are 
to be preformed repeatedly (after a one-time preprocessing). 

We focus on cuts and flows, which are of fundamental importance in computer science, engineer- 
ing, and operations research, because of their frequent usage in many application areas. Specifically, 
we study the compression of a large graph G containing k "important" vertices (called terminals), 
into a smaller graph G' containing the same terminals, while maintaining the following condition: 
the minimum cut between every bipartition of the terminals has exactly the same value in G and 
in G' . The above cut condition can be also stated in terms of maximum flow, because it effectively 
deals with the single-source single-sink case, for which we have the max-flow min-cut theorem. We 
now turn to define this problem more formally, restricting our attention (throughout) to undirected 
graphs. 

A network (G,c) is a graph G with an edge-costs function c : E(G) — >• M + . The size of a 
network is its number of vertices of G. The network is called a k-terminal network if the graph G 
has k distinguished vertices called terminals, denoted Q = {qi, . . . , qt} C V(G). In such a network, 
a cut (W, V(G) \ W) is said to be S-separating if it separates the terminals subset S C Q from 
the remaining terminals S := Q \ S , i.e. if W D Q is either S or S. When clear from the context, 
(W, V{G) \ W) may refer not only to a bipartition of the vertices, but also to its corresponding 
cutset (set of edges crossing the cut). The cost of a cut (W, V(G) \ W) is the sum of costs of all 
the edges in the cutset. We let mincutc jC (5, S) denote an S-separating cut in the network (G, c) of 
minimum cost (breaking ties arbitrarily). With a slight abuse of notation, we use the same notation 
to denote also the cost of the that cut. We also omit the subscript c when clear from the context. 

Definition 1.1 (Mimicking Network [HKNR98]). Let (G,c) be a k-terminal network. A mimicking 
network of (G,c) is a k-terminal network (G',c') with the same set of terminals Q, such that for 
all S C Qfl 

mincutG' jC '(S, S) = mincutG, c (5' ) S). 

The above definition (albeit for directed networks) was introduced by Hagerup, Katajainen, 
Nishimura, and Ragde [HKNR98] , who proved that every fe-terminal network [G, c) admits a mim- 

1 Throughout, we omit the trivial exclusion S 7^ 0, Q. 
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Table 1 : Known bounds for the size of mimicking networks 

icking network of size at most 2 . Subsequently, Chaudhuri, Subrahmanyam, Wagner, and Zaro- 
liagis [CSWZOO] studied specific graph families, showing an improved upper bound 0(k) for graphs 
G that have bounded treewidth. For the special case of outerplanar graphs G, the mimicking 
network G 1 they construct is furthermore outerplanar. Some of these previous results hold also for 
directed networks. 

The only lower bound we are aware of on the size of mimicking networks is k + 1 for every 
k > 3, even for a star graph, due to [CSWZOOJ. For k = 4, 5 they further show a matching upper 
bound. These results are summarized in Table [TJ We mention that several other variants of the 
problem were studied in the literature, in particular when cut values are preserved approximately, 
see Section H~2l for details. 



1.1 Our Results 

(a) Upper bounds. We first prove (in Section^) a new upper bound for planar graphs, which 
significantly improves over the bound that follows from previous work (namely, 2 known for 
general graphs [HKNR98]). See also Table [I] for the known bounds. 

Theorem 1.2. Every planar k-terminal network (G, c) admits a mimicking network of size at most 
0{k 2 2 2k ), which is furthermore a minor of G. 

Notice that our theorem constructs for an input graph G a mimicking network that is actually 
a minor of it, and thus preserves additional properties of G such as planarity. 

(b) Lower bounds. We further provide (in Section [3]) two nontrivial lower bounds. See Table [l] 
for comparison with the known bounds. The following theorem addresses general graphs, and 
narrows the previous doubly-exponential gap (between k + 1 and 2 2 ) to be only singly-exponential. 

Theorem 1.3. For every k > 5 there exists a k-terminal network such that every mimicking 
network of it has size 2^ k \ This holds even for bipartite networks with all the terminals on one 
side and all the non-terminals on the other side. 

The next theorem is for mimicking networks of planar graphs, proving a lower bound on the 
number of edges. If the mimicking network is guaranteed to be sparse (say planar, as is the case 



in our bound in Theorem 1.2) then we get a similar bound for the number of vertices. But if the 
mimicking network could be arbitrary (e.g., a complete graph) we do not know how to prove it 
cannot have 0(k) vertices. 

Theorem 1.4. For every k > 5 there exists a planar k-terminal network such that every mimicking 
network of it has at least Q,(k 2 ) edges. 
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Remark. Very recently, we were informed of new results, obtained independently of ours, by 
Khan, Raghavendra, Tetali and Vegh jKRTV12] . Their results include improved upper bounds for 
general graphs (albeit still doubly-exponential in k), for trees, and for bounded treewidth graphs, 
as well as lower bounds that are comparable to ours. 



(c) Succinct data structures. Our final result is an alternative formulation of graph compres- 
sion as the problem of storing succinctly (i.e., summarizing or sketching) all the 2 k terminal cuts 
in a ^-terminal network. 

Definition 1.5. A terminal-cuts (TC) scheme is a data structure that uses storage (memory) M 
to support the following two operations on a k-terminal network (G,c), where n = |V(G)| and 
c: E(G) -> {l,...,n°M}. 

1. Preprocessing P, which gets as input the network and builds M . 

2. Query Q, which gets as input a subset of terminals S, and uses M (without access to (G, c) ) 
to output mincutG jC (5, S). 

Observe that putting together the two conditions above gives Q(S; P(G)) = mmcutG, c (S, S) 
for all S C Q. The storage requirement (or space complexity) of the TC scheme is the (maximum) 
number of machine words used by M. Since the value of every cut in (G,c) is at most n°^ l \ and 
since we need to be able to represent every vertex in G, we shall count the size of the TC scheme in 
terms of machine words of 0(log n) bits. An obvious upper bound is 2 k machine words, by explictly 
storing a list of all the cut values. Perhaps surprisingly, we can show a matching lower bound for 



any data structure using the technology developed to prove Theorem 1.3 We prove the following 



Theorem 1.6 including an extension of it to randomized schemes, in Section |4| 



Theorem 1.6. For every k > 5, a terminal-cuts scheme for k-terminal networks requires storage 
of 2^ k > machine words. 



1.2 Related Work 

Graph compression can be interpreted quite broadly, and indeed it was studied extensively in the 
past, with many results known for different graphical features (the properties we wish to preserve). 
For instance, in the context of preserving the graph distances, concepts such as spanners [PS89j 
and probabilistic embedding into trees AKPW95, Bar96j, have developed into a rich area with 
productive area, and variations of it that involve a subset of terminal vertices were studied more 
recently, see e.g. [CE061 IKZT2] . 

In the context of preserving cuts (and flows), which is also our theme, the problem of graph 
sparsification [BK96] has recently seen an immense progress, see [BSS09J and references therein. 
Even closer to our own work are analogous questions that involve a subset of terminals, and the goal 
is to find a small network that preserves (the cost of) all minimum terminal cuts approximately. 
In particular, Chuzhoy [Chul2| recently showed a constant factor approximation using a network 
whose size depends on (certain) edge-costs is in the original graph. Another variation of our 
problem is that of a cut (and flow) sparsifier, in which the compressed network should contain only 
k vertices (the terminals) and the goal is to minimize the approximation factor (called congestion), 
see [CLLM101 lEGK+lOi IMMlOj for the latest results. 
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2 Upper Bound for Planar Graphs 



In this section we prove Theorem |1.2[ showing that every planar fc-terminal network (G,c) admits 
a mimicking network of size 0(k 2 2 2k ), which is in fact a minor of G. 



2.1 Technical Outline 

Let G be a planar /c-terminal network, and assume it is connected. Let Eg = mincut(y c (iS', S) be 
the cutset of a minimum S'-separating cut in (G, c), and let E be the union of Eg over all subsets 
S C Q. Removing the edges E from the graph G disconnects it to some number of connected 
components, and we construct our mimicking network G' by contracting every such connected 
component into a single vertex. It is easy to verify that these contractions maintain the minimum 
terminal cuts. This method of constructing G' resembles the one in [HKNR98J, except that the 
sets of vertices that we unite are always connected, hence our G' is a minor of G. We proceed to 
bound the number of connected components one gets in this way, as this will clearly be the size of 
our mimicking network G' . 

We first consider removing from G a single cutset mincutc(5, S) (for arbitrary S C Q), and 



show (in Lemma 2.1) that it can disconnect the graph into at most k connected components. We 



then extend this result to removing from G two cutsets, namely mincutc(S', S) and mincutc(T, T) 



(for arbitrary S, T C Q), and show (in Lemma 2.2) such a removal can disconnect the graph into 
at most 3k connected components. Next, we consider removing all the m = 2 k ~ 1 — 1 cutsets of the 
minimum terminal cuts from G (i.e., E) from G. However, naive counting of the number of resulting 
connected components, which argues that every additional cutset splits each existing component 
into at most 0{k) components, would give us in total a poor bound of roughly k m . 

The crucial step here is to use the planarity of G to improve the dependence on m significantly, 
and we indeed obtain a bound that is quadratic in m by employing the dual graph of G denoted G*. 
Loosely speaking, the cutsets in G correspond to (multiple) cycles in G* , and thus we consider the 
dual edges of E, which may be viewed as a subgraph of G* comprising of (many) cycles. We now 
use Euler's formula and the special structure of this subgraph of cycles; more specifically, we count 
its vertices of degree > 2, which turns out to require the aforementioned bound of 3 A; for two sets 



of terminals S, T. This gives us a bound on the number of faces in this subgraph (in Lemma 2.6), 



which in turn is exactly the number of connected component in the primal graph (Corollary 2.7) 



2.2 Preliminaries 

Recall that a graph is called a multi-graph if we allow it to have parallel edges and loops. A cycle 
in a multi-graph G is a sequence of edges (no, vq), . . . , (u;_i, i^-i) such that Vi = U( i+1 ^ mod ; for 
all i = 0, — 1. The cycle is simple if it contains / distinct vertices and I distinct edges. Note 
that two parallel edges define a simple cycle of length 2, and that a loop is a cycle of length 1 that 
contributes 2 to the degree of its vertex. A circuit is a collection of cycles (not necessarily disjoint) 
C = {Ci, . . . , Ci}. Let £(C) = Ui=i C« be the set of edges that participate in one or more cycles 
in the collection (note it is not a multiset, so we discard multiplicities). The cost of a circuit C is 
defined as X^ e e£(c) c ( e )- 

For a graph G, let CC{G) denote the set of connected components in the graph. In particular, 
if CC(G) = {Pi, . . . , Ph} then V(G) = P\ U • • • U Ph as a disjoint union. For a subset of the vertices 
W C V(G), let 5{W) denote the set of edges with exactly one endpoint in W, i.e. 5{W) = {(n, v) G 
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E(G) : u G W,v ^ VF}. A vertex in G with degree more than 2 will be called a meeting vertex of 
G. We introduce special notation for two (disjoint) sets of vertices: 



V m (G) = {v £ V : deg(v) > 2}; V 2 (G) = {v G V : deg(u) = 2}; 

and for two (disjoint) sets of edges: 

E 2 (G) := {(u,v) G £(G) : «,« G F 2 (G)}; 
£ m (G) := {(«,«) G -E(G) : u G F m (G) or u G F m (G)}. 

2.3 Proof of Theorem PI 

Let (G, c) be a fc-terminal network with terminals Q = {q%, . . . , where G is a connected plane 
graph with faces F (if G is not connected we can apply the proof for every connected component 
separately). We may assume, using small perturbation on the edges cost, that every two different 
subsets of edges in G have different total cost. In the proof we will use the notations Eg and E 
defined in Section [2. 11 



Lemma 2.1 (One cutset). For every subset of terminals S, the graph G\Eg has at most k connected 
components. 

Proof. If there are more than k connected components then there is at least one connected com- 
ponent without any terminal vertex. Since G connected, we can unite it to any other connected 
component by removing some edge from Eg. We get a new cutset that separates S from S with 
smaller total cost than Es in contradiction to the minimality. □ 

Lemma 2.2 (Two cutsets). For every two subsets of terminals S and T , the graph G \ (Eg U Er) 
has at most \CC{G \ Eg)\ + \CC{G\Et)\ +k connected components. 

We illustrate this lemma in Figure [lj The idea is that if G \ [Eg U E T ) has too many connected 
components, then we can find one that contains no terminals, and that moving it to the other side 
of (say) G \ Eg contradicts the minimality of Eg. 



Proof ofLemma\K^ Let CC(G \ (Eg U E T )) = {P , . . .,P h }. For every P h we let Wg(Pi) := 
5 (Pi) n Eg be the set of edges in Eg that have exactly one of their endpoints in Pi, and similarly 
Wt(Pi) '■= S(Pi) R Et- We can use the above notation to associate every connected components Pi 
of G \ (Eg U Et), to one of the following four sets: 



G\E S 5 = {ft} C\E T S = { qi ,q 2 } C\(E S UE T ) 






Figure 1: As depicted in gray, G \ Eg has two connected components, G \ Et has three, and 
G \ (Eg U Et) has five. Notice the connected component Vi of G \ (Eg U Et) contains no terminals. 
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G\E' T S = { qi) q 2 } 




Figure 2: E' T = (Et U Ws) \ Wt, where the red edges we removed are Wt, and the blue edges we 
add are Ws- 

1. Ws(Pi) = 0; in particular, P G CG(G \ P T ). 

2. W T (P) = 0; in particular, P G CG(G \ P s ). 

3. Ws(Pi) = W T {Pi); in particular P G CC(G \ P 5 ) n GC(G \ P T ). 

4. Ws(Pi) ^ 0, W-r(P) / and W s {Pi) + W T (Pi); in particular P £ GG(G\P 5 )UGG(G\P T ). 

Every connected component that belongs to the last set (i.e. there are at least two different 
edges in 5(Pi), one from Et and one from Es) will be called a mixed connected component of 
G \ (Es U Et)- Thus, the number of connected components in G \ (Es U Et) is bounded by 
|GG(G \ Eg)\ + |GG(G \ E$)\ plus the number of mixed connected components of G \ (Es U Et)- 

Assume towards contradiction that there are more than k mixed connected components in 
G \ (EsU Et)- Therefore, there exists at least one mixed connected component, say with out loss of 
generality Po, without any terminal in it. Since Pq is a mixed connected component in G\(EsL)Et) 
we know that Ws(Po) 7^ 0, Wt(Po) 7^ and Ws{Po) 7^ Wt(Pq)- For simplicity from now on we will 
drop the po and refer Ws and Wt to Ws(Pq) and Wt{Pq) correspondingly. By the perturbation 
on the edges cost the total cost of these two subsets must be different. Assume without loss of 
generality that c(Ws) < c(Wt)- We will replace the edges Wt by the edges Ws in the cutset of 
T and call this new set of edges E' T , i.e. E' T = (Et U Ws) \ Wt- It is clear that c(E' T ) < c(Et)- 
We will prove that E' T is also a cutset that separate T from T in the graph G, contradicting the 
definition of Et- See Figures [T] and [2] 

Denote CC(G\Et) = {Pq, • • • , P' h /} and assume without loss of generality that the set of edges 
Ws connects the connected component Po and the t connected components Pi, ... Pt of G\(EsL)Et) 
into one connected component Pq in G \ Et- Therefore, by adding the edges Es \ (Ws U Wt) to 
the graph G \ (Es U Et) We will get the graph G' = G \ (Et U Ws) and its connected components 
will be Po, Pi j . . . , Pt, P'x-, ■ ■ ■ , P' h i- Since the graph G' do not contains any edge from Et-, the sets 
T and T are still separated. 

Now it remain to add the edges Wt to the graph G' in order to get the desirable graph G \ E' T . 
Assume without loss of generality that Pq contains terminals from T. Then, by the minimality of 
Et, if edges from Wt connect between Pq and P-, then the terminals of P- are from T. In particular, 
adding the edges Wt to G' will connect Pq to some connected components P[ that contains only 
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terminals from T. Since Po does not contains any terminals, the connected component that was 
combined by the edges Wt contains only terminals from T, and so E' T separate between T and 
f. ' □ 

Planar duality. Recall that every planar graph G has a dual graph G* , whose vertices correspond 
to the faces of G, and whose faces correspond to the vertices of G, i.e., V(G*) = {vj : f G F(G)} 
and F(G*) = {f* : v G V(G)}. Every edge e = (v,u) G E(G) with cost c(e) that lies on the 
boundary of two faces /i,/2 G F(G) has a dual edge e* = (t^,^) G E(G*) with the same cost 
c(e*) = c(e) that lies on the boundary of the faces /* and /*. For every subset of edges H C E(G), 
let H* := {e* : e G i?} denote the subset of the corresponding dual edges in G* . 

The following theorem describes the duality between two different kinds of edge sets - minimum 
cuts and minimum circuits - in a plane multi-graph. It is a straightforward generalization of the 
case of si-cuts (which are dual to cycles) to three or more terminals. We are not aware of a reference 
for this precise statement, although it is similar to |HS85j lRao87] . See also Figure [3] (in page [7]) for 
illustration. 

Theorem 2.3 (Duality between cutsets and circuits). Let G be a connected plane multi-graph, let 
G* be its dual graph, and fix a subset of the vertices W C V(G). Then, H C E{G) is a cutset in 
G that has minimum cost among those separating W from V(G) \W if and only if the dual set 
of edges H* C E(G*) is actually £(C) for a circuit C in G* that has minimum cost among those 
separating the corresponding faces {/* : v G W} from {/* : v G V(G) \ W}. 



c(e*) =8 




Figure 3: A planar 3-terminal network G (in black), with E$ depicted as dashed edges. The dual 
graph G* is shown in blue, with E* s depicted as dashed edges. 

Recall that removing edges from a graph G disconnects it into (one or more) connected com- 
ponents. The next lemma characterizes this behavior in terms of the dual graph G*. Recall that 
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G[H] is a standard notation for the subgraph of G induced by the subset (of edges or vertices) H. 

Lemma 2.4 (The dual of a connected component). Let G be a connected plane multi-graph, let 
G* be its dual, and fix a subset of edges H C E(G). Then P is a connected component in G\H if 
and only if its dual set of faces {/* : v £ P} is a face of G*[H*]. 



Leveraging the planarity. We proceed with the proof of Theorem |1.2[ and now use the duality 
of planar graphs. In the following corollary we will deal with the dual graph G*[E S U E^] for 
arbitrary two subsets of terminals S and T. 

Corollary 2.5. For all S,T C Q, the graph G*[E S U Ej\ has at most 6k meeting vertices. 



Proof of Corollary |j?.5[ According Lemmas 2.1 and 2.2 , the graph G \ (E$ U Et) has at most 



\CC{G \ Es)\ + \CC{G \ Et)\ + k < 3k connected components. By Lemma [2~4| every connected 
component in G \ (E$ U Et) corresponds to a face in G* [Eg U Ej\. Therefore, G* [E% U Ej\ has at 
most 3k faces. 

By the duality of cuts and circuits, every set of edges E% is a circuit. Therefore, every vertex v 
appearing in these edges E s , has degree at least 2. E% U E^ is circuit as well, and all its vertices 
have degree at least 2, i.e. V(G*[E* S \J E* T ]) = V 2 (G*[E* S U E* T ]) U V m (G*[E* s UE^}). To simplify 
the notation we denote G* ST = G* [E s U E^] . By Handshaking lemma, 

2\E(G ST )\ = Y, de sW ^ 3 \ v m(G* ST )\ + m(G ST )\ = 2\V{G ST )\ + \V m {G* ST )[. 

veV(G" ST ) 

By Euler's formula 

3A; > \F(G* ST )\ = \E(G* ST )\ - \V(G ST )\ + \CC(G* ST )\ + 1 > \[V m {G% T )[, 
and the corollary follows. □ 
Recall that in Section 



2.1 



we defined E := U^cQ and denote its set of dual edges by 



E*:={e*: e e E} = \J ScQ E* s . 

Lemma 2.6. The graph G*[E*] has at most 0(k 2 2 2k ) faces. 



Proof of Lemma \2.6[ Using Theorem 2.3 we get that for every S C Q, Es is a minimum cutset in 
G if and only if E$ (the dual set of edges of Es) is a minimum circuit in G* . Moreover, as defined 
in Section 



2.3 E* = UscQ Eg. Thus, E* is also a circuit, and so 

\V(G*[E*})\ = \V 2 (G*[E*})\ + \V m (G*[E*])\, (1) 
\E(G*[E*])\ = \E 2 (G* [E*})\ + \E m (G*[E*})\. (2) 

According the definitions and the Handshaking lemma we get that 

\E 2 (G*[E*])\ < \V 2 (G*[E*)\. (3) 
By a union bound, the two following inequalities holds 

\V m (G*[E*})\ <Esc Q \ V (G*[E s ])nV m (G*[E*})\, (4) 
\E m (G*[E*])\ <Esc Q \ E ( G *i E *s])^ E m(G*[E*])\. (5) 
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Fix a subset S. We will start by bounding the set of vertices V(G*[E* S ]) n V m (G*[E*]). Fore 
every vertex v in V(G*[E* s })nV m (G*[E*]) there exists a subset T such that v is also in V(G*[E s ])n 



V m (G*[E* U ££]). According to Corollary [2^| \V m (G*[E* U E%])\ < 6k. Therefore \V(G*[E*]) n 
F m (G*[^U^])| < 6k, and by union bound on all the subsets T we get \V(G*[E* s ])nV m {G*[E*])\ < 
6k2 k . 



We will now move to bound E(G*[E s ])PiE m (G*[E*]). By Lemma 2.1 there are at most k cycles 
that cover the graph G*[E S ], so every vertex in V(G*[E S ]) n V m (G*[E*]) can be shared by at most 
k cycles of G*[E S ], which bound the degree of every vertex in G*[E S ] by 2k. Thus 

\E(G*[E* S }) n E m (G* [E*])\ < 2k\V(G*[E* s }) n V m (G* [E*})\ = 0{k 2 2 k ) (6) 



We can bound |CC(G*[-E*])| by extending the argument in Lemma 2.1 Assume toward con- 
tradiction that \CC(G* [E*])\ > k + 1. Thus, there exists at least one connected component P in 
G*[E*\ that does not contains any terminal face of G* . By the construction of E* , there exists a 
subset S such that P contains at least one cycle C of the circuit Eg. Since P does not contain any 
terminal face, we can remove some edge e* of the cycle C from the circuit Eg and get circuit with 
smaller cost that separates between f s and f% in contradiction. 

Now by Euler's formula, 

\F(G* [E*])\ =\E(G*[E*])\ - \V(G*[E*])\ + 1 + \CC{G*[E*])\ 

<\E m (G*[E*])\-\V m (G*[E*])\ + l + k byEqns. Q ,(§,(§ 

< EscQ 0(k 2 2 k ) + l + k = 0(k 2 2 2k ), by Eqns. (§,(§ 

and the lemma follows. □ 

Corollary 2.7. There are at most 0(k 2 2 2k ) connected components in the graph G\ E. 



This corollary foll ows from Lemma 2.6 by applying Lemma 2.4 with H = E. We now complete 
the proof of Theorem 1.2 Merge the vertices in each connected component of G \ E into a single 
vertex (formally, contract all the internal edges in each connected component) and call this new 
multi- graph M. Notice there is at most one terminal vertex in each connected component. So a 
vertex in M, which corresponds to a connected component (of G\E) that contains some terminal 
vertex q, will be identified with that terminal q. To be concrete, the vertices and the terminals of 
M are the sets 

V(M) := { Vi : P t G CC(G \ E)} 

Q(M) := {q = v, t : Pi € CC(G \ E) and q G PJ 

In addition, (vi,Vj) is an edge in M if there exist two vertices Ui,uj G E{G) such that m G Pi, 
Uj G Pj and (uj, Uj) is an edge in G. The cost of every edge (vi, Vj) G E(M) is 

c'(vi,Vj) := ^2 c(ui,Uj). 

UiePi,UjePj: (ui, Uj )eE(G) 

It is easy to verify that M is a minor of G with 0{k 2 2 2k ) vertices that includes the same k 
terminals Q. We now prove that (M, c') is a mimicking network of G using the same argument 
as in [HKNR98 , but applied to the connected components. Fix a subset of terminals S. Since 
we only contract edges, every cut that separates between S and S in M has a cut in G that 
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separates between S and S with the same cost, thus mincutjvf jC /(S', S) > mincutG, c ('S', 5). In the 
other direction, notice that by the construction of M, all the vertices in each connected components 
of G \ E are on the same side of the minimum S*-separating cut in G. Thus, there is a cut in M 
that separates between S and S and has cost mincutG , (»S', S). Combining these together, we get the 
equality mmcutM,c'(S, S) = mincutG, c (S'> S) for every S, and Theorem 



1.2 



follows. 



3 Lower Bounds 



In this section we prove Theorems 1.3 as well as Theorem 1.4 



3.1 Techniques and Proof Outline 

All our lower bounds are proved using the same technique, which basically counts the number of 
"degrees of freedom" needed to express all the relevant cut values. Formally, we develop a certain 
machinery based on linear algebra, which relates the size of any mimicking network to the rank of 
some matrix. 

The lower bound proofs start by describing a fc-terminal network {G, c) that seems minimal in 



the sense that it does not admit a smaller mimicking network. The networks used in Theorems 1.3 



and 1.4 are different, see Section [3] for details. We then identify the minimum cost 5-separating 



cuts for all (or some) S C Q, and capture this information in a matrix. 

Definition 3.1 (Incidence matrix between cutsets and edges). Let (G,c) be a k-terminal network, 
and fix an enumeration Si, ... , Sm of all m = 2 k ~ 1 — 1 distinct and nontrivial bipartitions Q = 
SiD Si. The cutset-edge incidence matrix of (G, c) is the matrix Aq, c £ {0, l} mxE ( G ) given by 



(A 



G,c)i,e 



1 i/eg mincut( G, c)(Si, Si); 
otherwise. 



We also define the vector of minimum-cut values between every bipartition of terminals 

/ mincutG, c (5'i, 5i) 



G,c 



\ mincut GiC (S' m ,S' r) 



Here and throughout, we shall omit the subscript c when it is clear from the context. Observe that 
if we think of the edge costs c as a column vector c G (R + ) E ( G ), then Aq ■ c = &g- For a given 
S C Q, a minimum 5-separating cut (W, V(G) \ W) is called unique if all other 5-separating cuts 
have a strictly larger cost. 

The core of our analysis is the next lemma, as it immediately provides a lower bound on the 
size of any mimicking networks; the theorems would follow by calculating the rank of Aq. 

Lemma 3.2 (Main Technical Lemma). Let (G,c) be a k-terminal network. Let Aq be its cutset- 
edge incidence matrix, and assume that for all S C Q the minimum S-separating cut ofG is unique. 
Then there is for G an edge-costs function c : E(G) — > M + , under which every mimicking network 
(G',d) satisfies \E{G')\ >rank(^ GjC ). 
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Notice that the bound is proved not for (G, c) but rather for (G, c); indeed, the edge-costs c are a 
small random perturbation of c. Thus, the proof of this lemma first shows that a small perturbation 
does not change the cutset-edge incidence matrix, i.e. Ag, c = Aq^- This is where the uniqueness 
property is used. Next, fix a small graph G 1 that can potentially be a mimicking network, but 
without specifying its edge-costs d; now let £qi be the event that (G, c) admits a mimicking network 
of the form (G',d). Since G' has too few edges (whose costs are undetermined/free variables), we 
can use linear algebra to show that Pr [£(;/] = 0. The lemma then follows by a union bound over 
the finitely many (unweighted) graphs G of the appropriate size. 



3.2 Proof of Lemma EO 



We turn to proving Lemma 3.2 Recall that this lemma considers a /c-terminal network (G,c), 
and assuming a certain (uniqueness) condition, asserts that there is for G a modified edge-costs 
function c, under which every mimicking network must have at least rank(Ac jC ) edges, where Ag }C 
is a cutset-edge incidence matrix of (G, c). 

The proof employs two lemmas and the following notation. For S C Q, let A.q C (S) > be the 
difference between the two smallest costs among all 5-separating cuts in G. Observe that if these 
two are not equal (i.e., Ag, c (S) > 0) then the minimum 5-separating cut is said to be unique in 
G. We also denote Ag jC := rains cQ ^-G,c(S). 

Lemma 3.3. For every edge-costs function w : E(G) — > [0, Ag |g(G)| ] ^ e cu t se t-edge incidence 
matrix of (G,c) is equal to the cutset-edge incidence matrix of (G,c + w), i.e. Aq, c = Ag, c +w, 
where c + w : e — >• c(e) + w(e). 

Proof. Let w be an edge-costs function w : E(G) — > [0, Ag ig(g)| ]- Since (G,c) and (G,c + w) 
have the same vertices and edges, every ^-separating cut in (G, c) is also a ^-separating cut in 
(G, c + w) and vice versa. The value of every such cutset in (G, c + w) is ranged from the value 
of this cutset in G to the value of this cutset in G plus • In particular, mincutG jC (5j, Si) < 

mincut g^ c+w (Si, Si) < mmcutG, c (Si, Si) + a^T - Thus, mmcutG, c +w{Si, Si) is smaller (by at least 
A ^ 1 ) than every cut that separates between Si and Si in G. Therefore it must be the case that 
the cutsets of the minimum S^-separating cuts in (G, c) and in (G, c + w) are the same. □ 



We proceed with the proof of Lemma 3.2. Sample an edge-costs function w : E(G) — > 
[0, a g LE(G)| ] ^ independently choosing each w(e) from that range uniformly at random. By 
the above lemma, Aq^ c = Ag, c +w so i n the rest of the proof we will omit the edge-costs function 
and denote this matrix by Aq- Now we argue that every mimicking network of (G, c + w) must 
has at least r := rank(^4c) edges. Consider some network G' with |2£(Cr')| < r, and let's see if 
it can potentially be a mimicking network of (G,c + w). Notice that every edge-costs function 
c' : E(G') — > M + for this G' yields a cutset-edge incidence matrix Aqi^i of size m x (r — 1) (if 
some graph has less than r — 1 edges we can pad the irrelevant columns with zeros). Since this 
matrix has only ones and zeros in its entries, there are only 2 m ( r ~ 1 ) such matrices. The next lemma 
proves that for every fixed matrix A E {0, i} mx ( r - 1 ) i the probability that there exists a edge-costs 
function d : E(G') -> M + such that A ■ d = Aq ■ (c + w) = $g is zero. 

Lemma 3.4. Fix a matrix A S {0, l| mx ('" _1 ) j and let Wa g and Wa be the span of the columns 
of Aq and A, respectively. If each w(e) is independently sampled uniformly at random from 
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[°> A G>C \E{G)\]> then 

Pv[A G ■ (c + w) G Wa] = 0. 

w 

Proof. Without loss of generality let the first r columns of the matrix A G , {d[, . . . ,a r },be the basis 
for the space Wa g - Since rank(A) < r = rank(^4c) we get that dim(WA) < dim(W/i G ). Thus there 
must be some basis vector of Wa g , say without loss of generality a\ , that not in the subspace Wa 
and denote by c(e±) + w(ei) to be its corresponding cost. 

We will calculate the number of vectors in Wa that can be expressed as linear combination with 
the vector d{. Let /(a) = ad[ + X^[=2( c ( e «) + w(ei))ai. If there are at least two such vectors, f(a) 
and f(a') (where a, a' / 0) in Wa, then d[ will be in Wa because Wa is a subspace. So there is 
at most one a such that /(a) G Wa- 

Since each w(ej) is sampled independently from a uniform distribution over [0, a g LE(G)| b ^ e 
probability that c(ei) + to(ei) = a is 0. By independence of w{ei) for all i G [r] we can sample 



u>(ei) last which complete Lemma 3.4 □ 



To complete the proof of Lemma |3.2[ we will calculate the probability that there exists a 
mimicking network (G' , c') for the network {G, c + w), such that |S(G")| < r. 

Pr^p mimicking network (G ,c ) with \E(G')\ < r] 

= Yt w [3A g , )C , g {0, l} mx ( r - 1 ' such that A&j ■ d = A G ■ (c + w)] 

< Pt w [3A g , >c , g {0, i} mx ('- 1 ) such that A G ■ (c + w) G W Ag ,J 

< EAe{o,i} mx ( r - 1) Pr ^[^G • (c + tZ?) G Wa] = 0, 

where the first equality is by definition of mimicking network, the following inequality is because 
the condition is necessary (but not sufficient), the second inequality is by a union bound over all 



possible matrices, and the final equality is by Lemma 3.4 Denoting c = c + w, we see that every 



mimicking network (G', c') for the network (G, c) has at least rank(Ac) edges. Lemma 3.2 follows 



3.3 Lower bound for general graphs 



We now prove Theorem |1.3| which asserts that for every k there exists a /c-terminal network that 
its mimicking network must have non-terminals. The proof constructs a bipartite /c-terminal 
network, with all its terminals on one side and all its non-terminals on the other side. As we will 
show, the rank of its cutset-edge incidence matrix is at least 2^ k \ and the corresponding cuts are 



unique, hence applying Lemma 3.2 to this matrix will complete the proof of Theorem 1.3 



Proof of Theorem \l.S\ Consider a complete bipartite graph G = (Q, U, E), where one side of the 
graph consist of the k terminals Q = {q\, . . . , q^}, the other side of the graph consists of I = (^2 k ) 

non-terminals U = {us^^, ■ ■ . , us t }, with S\, . . . , Si denoting the different subsets of terminals of size 
The costs of the edges of G are as follows. Every non-terminal us t is connected by edges of 
cost 1 to every terminal in Si, and by edges of cost 2 + e to every terminal in Si = Q \ Si, for 
sufficient small e > 0, in fact e = I suffices. Let c(usi,qj) denote the cost of edge (us^qj), and 
define c(u Si ,Sj) := ^2 qeSj c(u Si ,q). 

Lemma 3.5. The minimum Si-separating cut is obtained uniquely by the cut [W, V(G)\ W) where 
W = {u Sl } U Si and V(G) \ W = {u Sj : j + i} U S t . 
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Proof. First, notice that for every i G [I] the total cost of all edges incident to usv is 

C(U SI ,Q) = c(u Si ,Si) + c(u Si ,Si) = y • 1 + ^(2 + e) = y + y (7) 

Consider such a set Si, and let us calculate the minimum Sj-separating cut. Since non-terminals 
are not connected to each other, the decision is done separately for every non-terminal us by simply 
comparing the costs of the edges (tig-, S{) versus (ttg-, Si). The crucial observation is that for non- 
terminal uSi- 

2k 

c(u Sl ,Si) = \Si\-l = Y <(2 + e)\S t \= c(u Si ,Si) 
For a non-terminal usj where i 7^ j, 

2k 

c(u Sj ,Si) = \Sj n 5,| • 1 + I Si \ Sj\ ■ (2 + e) = \S t \ • 1 + \Si \ Sj\ ■ (1 + e) > y + 1 > c(u Sj ,Si) 

where the last inequality is by ^ and because we choose e such that ^ < 1. It follows that for 
every Si the minimum ^-separating cut will be {its,} U Si on one side, and {us 3 ■ j 7^ i} U Si on 
the other side, and moreover it is the unique minimizer. □ 

Lemma 3.6. Let Aq be a cutset-edge incidence matrix of G. Then rankle) > I- 

Proof. By definition, Aq is a matrix of size m x kl. Since (2^) = I < m = 2 k ~ 1 — 1, we need to 
show that I rows of Aq are linearly independent. Assume without loss of generality that the first 
I rows of Aq corresponds to the I subsets of terminals of size |/c, such that row t corresponds to 
subset St- We will prove that these I first rows of Aq are linearly independent, i.e. Ylt=i a t^Gt 
<^=^> ati = . . . = an = 0. We will focus on a column j in Aq that corresponds to some edge 
(uSi,q) where q G Sj. In order to know how the j-th column in Ac looks like, we need to know in 
which minimum cuts the edge (us t , q) participates, i.e. we go over all the rows of Aq and in each 
row t we will ask if the edge is in the cutset of the minimum Si-separating cut or not (if there is 1 
or in (A G ) t>j ). 

According to the construction of G, if q G Si, then the terminal q and the non-terminal ug 4 are 
in different sides of the minimum Sj-separating cut, and the edge (us i} q) in that cutset. For some 
subset St, where t 7^ % and q G St, the side of the minimum cut that contains the terminal q will be 
{us t } U St, and the other side that contains u^^ will be {us f : / € [/], / 7^ t} U St. Then again, the 
edge (us^q) will be in that cutset. It remain to look on some subset St, where t 7^ i and q G St- 
The cut will be the same as above, but now both of the vertices will be in one side of the cut, 
i.e. q,usi G {us f '■ f G [I], f 7^ t} U St, so the edge (us^q) will not participate in this cutset. In 
conclusion, The edge (lis^, q) participate in the cutset of the minimum Sj-separating cut, and in all 
the cutsets of the minimum St-separating cut such that St do not contains the terminal q. Hence 
we will get that the entry j (the column of Aq that corresponds to the edge (us., q)) in the vector 
Y!t=i dtA Gt is: 

C^2 a tAG t )j = ^2 a t( A c)t,j = + ^2 °t = ° ( 8 ) 

t=l t=l te[l\: qfSt 

Every two different subsets Si and S#, have at least ^k terminals in common. In particular there 
exist some terminal q contained in both of them. Looking at the entries corresponding to (us t , q) 
and (us., , q) in the vector Ylh=i a h^Gh we have 
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OLi + at : 

te[«]: g^s t 
a*' + ^ at 

te[l]: q$St 







Thus on = on' f° r every G [Z]. So we get the equation 
vector equation Ylt=i a t^Gt — 0> an d Lemma 



fc-i 



3.6 



follows. 



lai = in every entry in the 

□ 



We can now complete the proof of Theorem |1.3| Applying Lemma 3.2 to our bipartite graph 
G and its cutset-edge incidence matrix Aq, we get that every mimicking network G of G has at 
least I = 2 n ^ edges. It follows that \V(G')\ > y/\E(G')\ > 2 n( - k l □ 



3.4 Lower bound for planar graphs 



In this section we prove Theorem |1,4| which shows a planar fc-terminal network, every mimicking 
network of which must have at least k 2 edges. The proof constructs a grid of size 0(k 2 ) with 2k 



terminals, and applies Lemma 3.2 on graph's cutset-edge incidence matrix. 



Proof of Theorem 1.4 Construct a planar 2/c-terminal network G with 2k terminals Q = {vi, . . . , Vk, 
hi,... , /ifc} as follows. Consider a grid with k columns and k rows. Let Uij be the non-terminal 
vertex at the zth column and jth row of the grid. To every vertex uij, for 1 < j < k, we attach 
a terminal vertex Vj of degree one, and at every vertex un, for 1 < i < k, we attach a terminal 
vertex hi of degree one. Prom now on, we will refer to i and j as indices between 1 to k, including 
1, excluding k. 

The costs associated with the edges of G are as follows: every edge that connects between a 
terminal to a non-terminal costs k 4 . The cost of all the edges between the vertices Uj & and itj+i 
and between the vertices Ukj and Ukj+i, is k 4 . All the remaining vertical edges will have cost 1, i.e. 
all the edges between m t j and Uj+ij. All the remaining horizontal edges, i.e. every edge between 
Uij and Mjj+i, will cost 1 — Ei.j, where £ij = -h. Notice that for every k > 2 the sum of all the £jj 
in G is 



k - 1 1 k 2k 3 

ij=l i,j=l 

Denote by the subset of the terminals {/ti, . . . , /ij, «i, . . . , Vj}. We are interested in all the 
(k — l) 2 minimum Sjj-separating cuts. See the grid G in Figure |4j 

Lemma 3.7. The minimum Si j -separating cut is obtained uniquely by the cut (W, V(G)\W) where 
W = Sij U {u a ^ :l<a<i,l'<P<j}. 

Proof of Lemma \37}\ Let Cjj be the cost of the Si ^-separating cut (W, V(G) \ W) described in the 
lemma. By a simple calculation, Cj j = i + j — Y^a=i £a ,j- Assume towards contradiction that the 
above cut (W,V(G) \ W) is not the minimum Sjj-separating cut in G, i.e. imncutQ(Si t j, Si t j) < 
Cij < k. Thus all the edges that are contained in mincutc(S'jj, Si j) have costs less then k. In 
particular, the edges with cost k are not contained in mmcutciSij , Sij), so the two terminals vu 
and hk are connected (which means, not disconnected when we remove that cutset). 
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mincut G (S 23 ,S 23 ) / — 

/ 

/ © 




Figure 4: The 2/c-terminal network, which used in Theorem 1.4, with minimum S^-separating 
cut (the red dashed line). All the vertical and horizontal bold edges has cost k 4 , the remaining 
horizontal edges has cost 1 — £ij and all the remaining vertical edges has cost 1. 



The cut (W, V{G) \ W) contains i horizontal edges and j vertical edges. This is the minimal 
number of vertical and horizontal edges that need to be removed in the minimum cut in order to 
separate Sij from Sij. Otherwise, if we remove less then i horizontal edges, there must be some 
terminal, h a , in Sij, such that no horizontal edges were removed from its row, thus h a connected 
to the terminals Vk and that in Sij. The argument for j vertical edges is similar. 

Another observation is that the total cost of every i + j + 1 or more edges in G (with cost less 
then k A ) is not less than i + j + 1 — s a ^ > i + j, where the inequality is by Equation (|9). 

We conclude that the minimum cut has exactly j vertical edges and i horizontal edges. 

By now we know that the cutset mmcutQ^Sij,Sij) contains i horizontal edges and j vertical 
edges. Furthermore, we know that the cutset (W, V(G) \ W) contains the first i horizontal edges 
between the jth column to the (j + l)st column , and the first j vertical edges between the ith row 
to the (i + l)st row. Thus, mmcutciSij , Sij) must contains at least one different edge than the 
cut (W, V(G) \ W). There are two cases: 

1. If mincutG(<Sij, Sij) contains at least one vertical edge on some column (3 > j, then it contains 
no more than j — 1 vertical edges from the columns between 1 to j. As before, there exist 
some terminal that is connected to at least one terminal from Sij. The same argument works 
for horizontal edge that removed from row a > i. Hence, this case is impossible. 

2. If all the edges that participate in mmcutciSij, Sij) are from the first i rows and first j 
columns. We will calculate the minimal value of a cut that we can obtain. As mentioned 
above, in order to separate we need to remove one edge from every column and from every 
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row. The cost of all the vertical edges is identical so already need to pay j. Notice that in 
every row a the following inequality chain holds 



£q,1 < £q,2 < • • • < £a,k-l 

Therefore, the cost of the cheapest edge that we can take from that row is 1 — £ a ,j- Summing 
all these costs we get j + Y^ a =i(^ ~ £ a,j)- 

From the second case we get that mincutG^ij, S^j) = Cij, and that the cut (W, V(G) \ W) is the 
only cut with that value as we wanted. □ 



Proceeding with the proof of Theorem 1.4 let Aq be a cutset-edge incidence matrix of G (see 



Definition 3.1 ). 



Lemma 3.8. rank(^ G ) > (k - l) 2 

Proof of Lemma \3.8[ Assume without loss of generality that the first (k — l) 2 columns of Aq cor- 
respond to all the horizontal edges that their cost involve an e^j variable. We will order them 
according to their order in the grid from left to right, up to down. i.e. the first (k — l) 2 columns 
of Aq will correspond to the edge costs in the following order: 

1 — £1,1 , • ■ ■ , 1 — £l,fc-l , 1 — £2,1 , • • • , 1 — £2,fc-l , • • • , 1 — Efe-1,1 , • • • j 1 — £fc-l,fc-l 

In addition, without loss of generality the first (k — l) 2 rows of Aq correspond to the [k — l) 2 
minimum Sjj-separating cuts in G which deals with the (k — l) 2 subsets of terminals we are 
interested in according to the following order: 



"Sj.,1 , • • • , Si k-i , S2.1 , • • • , S2k-i > • • ■ j £fc-i,i , • • • j Sk- 



l.k-l 



We will show that the sub matrix of Aq formed by first (k — l) 2 rows and columns of Aq is a 
lower triangular matrix, which imply that the first (A; — l) 2 columns are linearly independent. Given 
column t that corresponds to 1 — e^, we need to show that the entry t, t is 1, and all the t — 1 first 
entries are 0. As we set above, the t-th row of Aq corresponds to the minimum S'jj-separating cut. 
According to Lemma |3.7| the total costs of the horizontal edges that participate in the minimum 
Sij-separating cut is Yla=i(^ ~ £ a,j)- Thus it is clear that entry t,t is 1, because the edge 1 — Eij 
participates in the minimum Sjj-separating cut. It remains to show that all the t — 1 first entries 
are 0. All the first t — 1 rows correspond to subsets of terminals S a> p such that a < i or a = i and 
P < j. As we saw above, the edge 1 — Sjj participates only in all the minimum cuts of the subsets 
S a j where a > i. Thus, there is in all the first t — 1 entries in the t-th column. So we prove that 
the first (k — l) 2 rows and columns of Aq form a lower triangular matrix as we wanted, and the 
Lemma follows. □ 



To complete the proof of Theorem 1.4, we apply Lemma 3.2 to our grid G and its cutset- 
edge incidence matrix Aq. We get that there exists an edge-costs function for G such that every 
mimicking network of G has at least rank^g) = Q(k 2 ) edges and the theorem follows. 

□ 
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4 Lower Bounds for Data Structures 



We can extend the definition of a (deterministic) TC scheme to a randomized one by letting the two 
operations access a common source of random bits. (We do not assume the random bits are stored 
explicitly in M, even though it might be required in some implementations.) We then change the 
requirement from the query operation to be 

Pt[Q(S;M) = mincut GiC (5, S)} > 2/3, 

where the probability is taken over the data structure's random bits. Our lower bound in Theorem 



1.6 holds also for randomized schemes, even those with shared randomness (that is not stored 



explicitly) . 



We now prove Theorem 1.6, which asserts that a terminal-cuts scheme requires 2 n ( fe ) words in 



the worst-case. Fix k and let (G, c) be the A:-terminal bipartite graph constructed in Section 3.3 
Recall that I := ( 2 ^ 3 ) is the number of subsets of terminals of size 2k/3, each corresponding to a 

non-terminal in G. The number of vertices in G is n := k + l = 2 e W, and size of a machine word 
is O(logn) = Q(k) bits. Assume towards contradiction there is a terminal-cuts scheme that can 
handle every fc-terminal network using less than Z/100 bits. For now, let us assume the scheme is 
deterministic. 



Let Ag, c be the cutset-edge incidence matrix of (G,c). By Lemma 3.6, rank(^4c jC ) > I. Let us 
assume that the first I columns of Aq jC are linearly independent (otherwise, we just reorder them), 
and let ej denote the edge of G corresponding to the j-th column of Aq jC . 

Let W denote the collection of 2 l edge-costs functions w : E(G) — > {0, ahrj} satisfying that 



w(ej) = for all j > I. As in Section 3.3, every function w € W defines a graph (G, c + w), whose 



cutset-edge incidence matrix is denoted Aq^+w We can now apply Lemma 3.3, since 6k > Ag jC 
and |-E(G)| = kl, and obtain that for all w £ W the network (G,c + w) has the same cutset-edge 
incidence matrix as (G, c), i.e. Aq^ = Aq^+w Using the above bound on the rank of Ag, c we can 
deduce that for every two different functions w ^ w' £ W, we have Aq <c ■ (c + w) ^ Ag, c ■ (c + w'), 
i.e. there exists S C Q such that mxacuta, c+w(S, S) ^ mincutG jC+w ' (5, S). 

Now, the assumed terminal-cuts scheme uses less than I /WO bits, and thus, by the pigeonhole 
principle, there must be w ^ w' E W, whose preprocessing results with the exact same memory 
image M = P(G,c + w) = P(G, c + w'). Consequently, for all queries S C Q, the scheme will report 
the same answer under inputs (G, c + w) and [G, c + w'), which means that mincutG, c +u;(5'> S) = 
mincutG jC+l0 / (S, S) and is a contradiction. 

Notice that the edge costs of the graphs (G, c + w) for w £ W can be easily scaled so that 
they are all in the range {0, 1, . . . , n ^ 1 '}. We conclude that a terminals-cut scheme for k te rmin als 
requires, in the worst case, storage of at least J/ 100 ? > words. This proves Theorem 1.6 for 



O(logn) 

deterministic schemes. 



Proof for randomized schemes (sketch). The proof for randomized schemes follows the same 
outline, the main difference being that we replace the simple collision argument between w ^ w' , 
with well-known entropy (information) bounds. First, the data structure's success probability can 
be amplified to at least (say) 1 — 2 2fc , by straightforward independent repetitions, while increasing 
the storage requirement by a factor of 0(k). So assume henceforth this very high probability is the 
case. 
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Now let us choose w G W at random, which corresponds to choosing a random string of Z bits. 
Using the data structure, one can retrieve with very high probability the value mincutG,c+«>(5', S) = 
Ag,c • (c + iw). Applying a union bound over all 2 fc subsets S C Q, with very high probability one 
would retrieves correctly all these values. In this case, since the first I columns of Aq^ yield an 
invertible matrix, we could actually recover the vector w itself (with high probability). But since 
w is effectively a random string of I bits, it follows by standard entropy bounds that M must have 
at least bits, and the theorem is completed just like for a deterministic scheme. 

5 Concluding Remarks 

Define a generalized mimicking network of a /c-terminal network (G, c) to be a /j-terminal network 
{G' , d) with the same set of terminals Q, such that for all disjoint S,T C Q, the minimum cost of 
a cut separating S from T is the same, namely mincut g> , C >(S,T) = mincutc C (S, T). Although this 
definition increases the number of cuts that must be preserved, our upper bound for planar graphs 
extends to this more general definition (but with larger constants in the exponents), and the same 
is true for the upper bound for general graphs by |HKNR98) . 
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